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1 Introduction 

In this paper we generalize the well-known construction of shuffle product algebras 
by using mixable shuffles, and prove that any free Baxter algebra is isomorphic to a 
mixable shuffle product algebra. This gives an explicit construction of the free Baxter 
algebra, extending the work of Rota [T5] and Cartier [2] . 

In an important paper published in 1958, Ree ^2] constructed algebras in which 
the product is expressed in terms of shuffles. He was motivated by Chen's work on 
iterated integrals of paths [2] , where this shuffle product is derived from the integration 
by parts formula 

f(t)dt [ g(t)dt= [ /(*)(/ g(s)ds)dt+ [ g{t){( f{s)ds)dt. (1) 
'o Jo Jo Jo Jo Jo 

Subsequently, shuffle product constructions have been studied extensively and have 
found applications in many areas of pure and applied mathematics. 

In another important paper published in 1960, Baxter P considered operators P 
that satisfies the identity 

P(x)P(y) + P(xy) = P(xP(y)) + P(yP(x)) 

and used this identity to study the theory of fluctuations. Rota studied Baxter's 
operators from an algebraic point of view and defined a Baxter algebra to be an algebra 
A with an operator P satisfying the identity 

P(x)P(y) + qP(xy) = P{xP(y)) + P(yP(x)) 
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for some fixed q in the base ring of A. Rota ^2] and Cartier [2] gave explicit construc- 
tions of the free Baxter algebra on a set X in the case when q = 1. 

It is easy to see that the product in Baxter algebras when q = can be described 
by shuffle products, and the shuffle product algebras considered by Chen and Ree 
have a canonical Baxter operator. However, there does not appear to be an explicit 
and systematic study of this connection between Baxter algebras and shuffle products 
except a remark in a recent paper of Rota 18. implying such a connection. 

The motivation for the current paper came from the desire of developing a theory 
that is "dual" to the beautiful theory of differential algebras obtained by Ritt [TH] and 
Kolchin Since a differential algebra is an algebra with an operator that satisfies 
the Leibniz product rule, it seems natural to study "integration algebras", i.e., algebras 
with an operator that satisfies an identity similar to the one in equation (JTJ. Of course, 
these are just Baxter algebras where q = 0. Unaware of the above mentioned work on 
Baxter algebras, we gave a description of the free integration algebra by using shuffle 
products. 

It was Rota who pointed out to us the earlier work on free Baxter algebras and 
suggested that we extend our shuffle product description of free integration algebras to 
Baxter algebras. This is carried out in this paper, by making use of a modified shuffle 
product, called the mixable shuffle product. Thus we not only consider shuffles of two 
vectors, but also shuffles in which certain components of a shuffle will "merge". This 
enables us to construct the mixable shuffle product algebras, generalizing the classical 
construction of shuffle product algebras, and to give a more intuitive and constructive 
description of the free Baxter algebra. Also, our description is of free Baxter algebras 
on any commutative algebra with any value q. Furthermore, the free Baxter algebra of 
Cartier or Rota is in the category of Baxter algebras not necessarily having an identity, 
while the free Baxter algebra we consider is in the category of Baxter algebras with an 
identity. When specialized to the case considered by Rota or Cartier, the free Baxter 
algebra we construct contains their free Baxter algebra as a sub-Baxter algebra in the 
category of Baxter algebras not necessarily having an identity. 

The mixable shuffle product of Baxter algebras allows us to study in more detail the 
properties of Baxter algebras. This will be the subject of a forthcoming paper. Shuffle 
products have occurred in many other fields and contexts, such as Hopf algebras, 
algebraic if-theory, algebraic topology and combinatorics, as well as in computational 
mathematics and applied mathematics; see, for example, jH |SJ H2 UH UH| • One 
might hope that the mixable shuffle introduced here will provide interesting and useful 
generalizations to those theories. One might also hope that the mixable shuffle product 
will be useful for the applications of Baxter algebras, such as those considered by Baxter 
and Rota. 

In this paper, we first give a brief summary of basic definitions and properties of 
Baxter algebras in section |21 We then make a careful study of mixable shuffles in 
section |3J In section 0] we apply properties of mixable shuffles to construct mixable 
shuffle product algebras and to prove that any free Baxter algebra is isomorphic to such 
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an algebra. In this section, we also briefly consider variations of the free construction 
by examining the free Baxter algebra on a set, on a commutative monoid and on 
a module. The relation between the free Baxter algebras we construct and the free 
Baxter algebras constructed by Rota and Cartier is described in sectional We conclude 
by considering the special case of the free Baxter algebra on the empty set. 

2 Definitions and basic properties 

In this paper, any ring R is commutative with identity element 1r. All notation will 
be standard unless otherwise noted. In particular, we write N for the additive monoid 
of natural numbers {0, 1,2,.. .} and N + = {n G N | n > 0} for the positive integers. 
Also, (™) will denote the usual binomial coefficient defined for any n, k G N with k < n 
by ( n k ) = n\/(k\(n - k)\). 

Definition 2.1. Let C be a ring, q G C , and let R be a C-algebra. A Baxter operator 

on R over C is a C -module endomorphism P of R satisfying 

P(x)P(y) + qP(xy) = P(xP(y)) + P{yP{x)), x, y G R. (2) 

We will find it convenient for the remainder of this paper to write equation (J2J) in 
the form 

P(x)P(y) = P{xP{y)) + P(yP(x)) + \P(xy), (3) 
so that our A is —q. We will also say that P has weight A. 

Definition 2.2. A Baxter C-algebra of weight X is a pair (R, P) where R is a 
C-algebra and P is a Baxter operator of weight A on R over C . 

If the meaning of A is clear, we will suppress A from the notation. 

Note that the mapping : R — > R defined by 0(r) = for all r G R is trivially a 
Baxter operator on R over R, for any ring R. Hence every C-algebra can be viewed as a 
Baxter C-algebra. Definitions of basic concepts for C-algebras can be similarly defined 
for Baxter C-algebras. In particular, let (R, P) and (S, Q) be two Baxter C-algebras 
of weight A. A homomorphism of Baxter C-algebras / : (R, P) — > (S, Q) is a 
homomorphism / : R — > S of C-algebras with the property that f(P(x)) = Q(f(x)) 
for all x G R. 

Let Baxc A denote the category of Baxter C-algebras of weight A. A Baxter ideal 
of (R,P) is an ideal I of R such that P(I) C I. For a Baxter ideal I of (R,P), the 
quotient Baxter C-algebra is the quotient algebra R/I, together with the C-linear 
endomorphism P : R/I — > R/I induced from P. If / : (R,P) — > (S,Q) is in Bax^A, 
then kerf is a Baxter ideal of R, and im/, with the restriction of Q, is a Baxter 
sub-C-algebra of (S,Q). 
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3 Mixable shuffle products 



This section is a preparation for the next section, where we will give a description of 
free Baxter algebras in terms of mixable shuffles. We start with a study of mixable 
shuffles in the context of permutations. We then apply this study to mixable shuffles 
of vectors. Finally we consider mixable shuffles of tensors, using the mixable shuffle 
of vectors as a "generic" form. For the purpose of providing a solid foundation for 
later applications, we give full details of the proofs, even though some of them might 
be intuitively clear to the expert. So some readers might just want to look at the 
definitions and results and move on to the next section. 



3.1 Permutation shuffles 

For m,n E N+, define the set of (to, n)-shuffles by 

S(m, n) = 



a G S, 



m+n 



a-\l) < a-\2) < ... < <r-\m), 

ct _1 (to + 1) < ct _1 (to + 2) < . . . < a' 1 (m + n) 



Equivalent ly, 

S(m, n) = 

(J G S m+n 



if 1 < a(r) < a(s) < m 

or to + 1 < a(r) < a(s) < m + n then r < s 

A pair of indices k + 1), 1 < k < m + n is called an admissible pair for an 
(m, n)-shufne a G S(m, n) if a(k) < m < a(k + 1). Denote T a for the set of admissible 
pairs for a. For a G S(m,n) and T C T°", call the pair (cr,T) a mixable (m, re- 
shuffle. When T = 0, (a, T) is identified with the shuffle a. Denote S(m, n) for the 
set of mixable (to, n)-shuffles. So 



5(771,71) = {((7,T) | a G S(m,n), T C T a }. 
s(m, n) 



Also denote 

S(m, n) | . 

Intuitively, a (to, n)-shuflle is a permutation <r of {1, . . . , to, to + 1, . . . , to + n} such 
that the order of {1, ... , to} and {to+1, . . . , m+n} are preserved in {cr(l), . . . , ct(to), cr(m+ 
1), . . . , a(m+n)}. Further, a mixable (to, n)-shuffle (a, T) is a (to, n)-shufne a in which 
pairs of indices from T represent positions where "merging" will occur. The precise 
meaning of "merging" will be described later. 

1 2 3 \ 

2 2 j is a (2, l)-shume. The pair (1,2) is an admissible 
12 3, 

is also a (2, l)-shume, but has no admissible 



For example, a = 



1 



pair for a. Note that r 
pairs. 



3 12 
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Denote 

S lfi (m,n) = {(a,T)ES(m,n)\ (1,2) £T,a-\l) = l}, 

S 0jl (m,n) = {(<r,T) G S(m,n) | (1,2) G" T,cr~ l {m + 1) = 1}, 

S 1A (m,n) = {(<J,T) G S(m,n) | (1,2) G T}. 

Then we clearly have 

S(m,n) = S lfi (m,n) (J 5 ,i(m,n) |J S 1A (m,n) (4) 

since, by the definition of S(m,n), one and only one of <x _1 (l) and a~ 1 (m + 1) equals 
1. Similarly we will define the set of (m, n, £)-shuffles by 



S(m, n, £) = 

ct-^I) < a- 1 (2) < ... < <r-\m), 
j a~ 1 (m + 1) < a~ 1 (m + 2) < . . . < a~ 1 (m + n), 

f G 5m+n+f a- 1 (m + n+ 1) < cj-\m + n + 2) 
< ... < a~ 1 (m + n + I) 

For a G S(m,n,£), define 

• • • 

= 7^i,o [J T\,o,\ [J ?o,i,i U ^1,1' 



where 

J i,i,o 
J i,o,i 
J o,i,i 

T a 
2 ixi 



+ 1) 

(A;,A; + 1) 
(fc, A; + 1) 



l<fc<m + ra + ^-l, <r(fc) < m < <r(fc + 1) 
and either <r(/c + 1) >m + n or m + n> er(A; + 2) 

l<k<m + n + £, a(k) < to, m + n < a(k + 1) 



l</c<m + n + £, o-(/c) < to + n < a(k + 1) 
and either a(k — 1) > to or to > u{k) 

1 < k < m + n + £ - 1, 



(k, k + 1, k + 2) 



cr(fc) < to < a{k + 1) 
< to + n < cr(k + 2) 



For (<r, T) G S(m,n,£), define 

deg r =| t n r^ 1 + l t n 7£ 0>1 1 + | t n r * tl \ +2 \ t n Tfa 



Denote 
and 



S(m,n,£) = {(a,T) | a G S(m,n,£),T G TJ, 
s(m,n,£) =| S(m,n,£) \ . 



(5) 
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Proposition 3.1. Let m,n G N+. 

1. s(m, n) = s(m — 1, n) + s(m, n — 1) + s(m — 1, n — 1). 
«. | {(<r,T) G S(m,n) | |r|=i}|= 

n 

3. s(m,n) = > ( )( ). 

' * n i 

i=0 

n+£ n 

. „. v — -\ v — -v . m+n+l—k . , I . „ m+n— i . . n . 

* S (m,n,f)^2 )( )( )( ). 

^ ^ £ fe— i n i 

k=0 i=Q 

Proof. 1. This follows from equation (J1J. 

2. Denote the set on the left hand side of the equation by S^\m,n). We use 
induction onm + n, with m, n > 1. When m + n = 2, this can be verified directly. In 
general, define «S/ 10 x(m, n) (i.o) (fn,n) and similarly for S^Jm, n) and 

^(ii)( m > n )- From equation (jlj), we obtain, 

S (i) (m,n) = S$J(m,n)U^3(m ) n)(j5g(m ) n) 

= S (i) (m- l,n) [J 1) [J S^im-l^n- 1). 

Here A = 5 means that the two sets A and I? have the same cardinality. By the 
inductive assumption, the size of the right hand side is 

, m+n— 1— i. , m— 1. .m+n— 1— i. , m. ,m+n— 2— i. . m— 1. 

( )( ) + ( )( ) + ( )( )• 

m— 1 i mi m— 1 8—1 

Applying Pascal's identity, we see that the last sum is also the value of ( m ^~*)( m )- 

3. This follows from part 2 by summing over % for i — 0, . . . , n. 

4. The proof is similar to the proof of part 3. For < k < n + £, denote 

T) G S^m, n, £) | deg T = k > . 
We will use induction on j = m + n + m, n, £ > 1 to prove 

n 

i , „« i x — ^,m+n+f— fc. , £ „ .m+n— i. . n. , . 

(^(m,n,f)h^( )( )( )( ). (6) 

£ fe— i n « 

i=0 
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When m + n + £ = 3, the equation can be verified directly. Now assume that the 
equation holds for m + n + t < j and consider the case when m + n + i = j . Denote 



S li0 , (m,n,£ 
So,i,o(m,n,£ 
S 0}0)1 (m,n,£ 
Si,i,o{m,n,£ 
Si >0 ,i(m,n,£ 
S 0tl)1 (m,n,£ 
S 1X x{m,n,£ 



{(a, T) E S(m, n 
{(a, T) E S(m, n 
{(a, T) E S(m, n 
{(a, T) E S(m, n 
{(a, T) E S(m, n 
{(a, T) E S(m, n 
{(a, T) E S(m, n 



a-\l) = 1,(1,2) £T}, 
t7- 1 (m+l) = l,(l,2)^T}, 
a-\m + n + l) = 1,(1,2) £ T}, 
(1,2)6^^,(1,2,3)^}, 
(1,2) em 7^ 0>1 , (1,2,3) ^T}, 
(1,2)^0^,(1,2,3)^}, 
(1,2,3) ET}. 



Also denote 



for u, v, w = or 1. It follows from the definition that 

5(m, n, £) = 

Si t o t0 {m,n,£) [J S ,i,o{m,n,£) [J S ,o,i{m,n,£) 

• • • • 

y 5i,i,o(m,n,^) y ^i^n,^) y £0,1,1(777., n,/) y s^i^m,™,^. 



So 



S {k) (m,n,£) = 

£i3,o( m > n ^) U S { Q %{m,n,£) (J S$ tl (m, n, £) 

• • • • 

U S[%( m ,n,£) U ^5,i(m,n,^) |J ^(m, n, I) |J SfJ^m.n,!) 
- 5g> (m - 1, n, *) |J ^ (m, n - M) (J S$, x (m, n, £ - 1) 
U Stfii™ ~ 1, n - 1, i) (J ^Ai V - 1, n, * - 1) 



|J 3fcV, n - 1, £ - 1) |J 5«(m - 1, n - 1, £ - 1). 

Here again = stands for a bijection between sets. Taking cardinalities and applying 
the induction hypothesis, we see that the left hand side of equation Q is 



i=0 



.m-j-n+E— k— 1 . , £ . ,m+n— i— 1. ,n, 
£ k—i n i 
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n-l 



+ E< 

i=0 
71 

+ E< 

i=0 
n-l 

+ E( 

i=0 
n 

+ E( 

i=0 
n-l 

+ E( 

i=0 
n-l 

+ E( 



i=0 



m+n+l— fc— 1 

m+n+£— fc— 1 
£-1 

m+n+£— fc— 1 
£ 

m+n+£— fc— 1 
l-l 

m+n+l—k—1 
I 

m+n+l—k—1 
£-1 



£ , .m+n— i— 1 . , n— 1 , 
fc— i ji—l i 



£— 1 , , m+n— 7—1 . , n 
fc— i n j 



I . , m+n— i— 2, 
fc— i— 1 n—1 



£— 1 m+n-t-1, 
fc— 7— 1 n 



£— 1 . .m+n— i— 1 , 
k—i—1 n—1 



l—l . .m+n— i— 2, 
k—i—2 n—1 



71-1, 



n—1 , 

7 

n-l, 



On the other hand, using Pascal's identity we see that the right hand side of the 
equation (jUJ) equals 



E' 

7=0 



. m+n+£— k . , £ , . m+n— i « , 7i « 
£ A;— i n 7 



, m+n+£— fc— 1, , I , .m+n— 7—1, . n, v — \ . m+n+l— k— 1 , , £ , . m+n— 7—1, ,n, 

• , >U< . «,) + E( « )(»_,)< ... )(,- 

7=0 7=0 



E 

7=0 

n 

E 

7=0 

? 

+E 

7= 

71 

E 

=o 

? 

+ E 

7=0 

n 

+E 

7 = 

n 

E 



m+n+l— k— 1. , £ , .m+n— 7, . n, v — -\ 

)(..)( )(.)+E 

fc— 7 n 7 ' * 



7=0 



. m+n+l— k— 1 . . £ ^,m+n— i, ,n, 
£—1 fc— i n i 



m+n+t— k— 1. ,£—1, ,m+n— i— 1, ,n, v — -\ , m+7«+£— fc— 1 , . £—1 , .m+n— i, ,n, 

)( )( )( ) + 2J )( )( )( ) 

£—1 fc— 7 n 7 £—1 fc— 7— 1 n i 

7=0 7=0 

n n 

, m+n+l— fc— 1 , , £ , , m+n— 7—1, . n, v — . m+n+£— fc— 1 . , £ , .m+n— 7—1, .n—1. 

• , >U< . )( , ) + E< , )(J( )(,) 

7=0 7=0 

n 

... « . . m+n— t—l ,. n—1 , ^ — -\ 

><,J+E< 



7=0 



m+n+l—k—1 £ m+n—i—l n—1, 



7=0 

n 



m+n+l—k—1 l—l m+n—i—1 n , 
£—1 fc— i n 7 



. m+n+£— fc— 1, , £—1 , , m+n— 7—1, ,n, v — ■> 

)(, . .)( 



?-l fc-7-1 



m+n+£— fc— 1 £—1 . .m+n— i— 1 . , n, 
7 ' ' £—1 k—i—1 n—1 i 

7=0 



n n 

, m+n+£— fc— 1, . £ , .m+n—i—1. ,n, v — ■> . m+n+£— fc— 1 , . £ , .m+n—i—1, . n— 1, 

• « >U ( . )( , ) + E( , )(,_,)( )( , ) 

=0 7=0 



S 



E. in+n+i—K— 1 . , l . .m+n—i—l. . n— 1, v — -\ . m+n+£— ft— 1 . ,z—l s .m+n—i—L. ,n. 

( )( )( )( + 23 )( )( )( ) 

£ fc-i n-1 i-1 z — ' l-l k-i n i 

i=0 i=0 

n n 

E.m+n+£— k— 1. , £— 1 „ ,m+n— i— 1. ,n. v — \ . m+n+i— k— 1 . . £—1 . . m+n— i— 1. ,n— 1. 
< . ><,)+£< f— i )(,) 

i=0 i=0 
n 

E. m+n+£— fc— 1. , £—1 . ,m+n— i— 1. . n— 1, 
( >W< „-, >(„> 

i=0 

n n— 1 

E,m+n+£— k— 1. , € . , m+n—i—l. ,n. v — \ , m+n+£— A;— 1 . . ^ . ,m+n— i— 1» .n— 1. 
( « >U< . )( , ) + £ ( « )( ,-, )( »<,) 

i=0 i=0 

n— 1 n 

E. m+n+£— k— 1 . . ^ . ,m+n— i— 2« . n— 1. v — -\ , m+n+£— ft— 1 . ,1—1. , m+n—i—l. ,n. 
< , )( t _J( >(,>+£( )(»_,)( „ )(,) 

i=0 i=0 

n n— 1 

E. m+n+£— k— 1. , £—1 . .m+n—i—l. ,n. v — \ . m+n+l— k— 1 . , £— 1 . .m+n—i—l. ,n— 1« 
( <- >W< „ )( ,)+£( M >W 'U 

i=0 i=0 
n-1 

E f m+n+i— k— 1« , 1 „ , m+n— i— 2. . n— 1. 
( )(_,)( )( , )• 



£-1 fc-i-2 n-1 

i=0 



This completes the induction, proving equation (jSJ). Then the equation in part 4 is 
obtained by summing equation (jBJ) for k = 0, . . . , n + £. ■ 



3.2 Mixable shuffles of vectors 

Let Q be a countable infinite set. Let Q be the set consisting of finite non-empty subsets 
of Q. For F — (Fx,..., F m ) efl m ,ff6 S m and T a subset of {(£,£ + 1) | 1 < / < m}, 
denote 

a(F; T) = (F^ F^ m )) 

where, for each 1 < £ < m, 

F . F _ / U if (l,f+l)GT 

ctW ' CT(m) 1 otherwise 

For example, let x x ,x 2 ,y,z G O, i*\ = {xi,x 2 } F 2 = {?/}, F 3 = {2}, a = 
(213) eS 3 ,T = {(2,3)}. Then 

a(F) = (F a (i),F a (2),F a (3)) = (F 2 ,F 1 ,F 3 ) 

and 

a(F;T) = (F 2 F\ F 3 ) 
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= (F 2 ,F 1 UF 3 ) 

= ({y},{ x u x 2,z}) eh 2 



LetF=(F u ...,F m )en m , G = (G u ...,G n )en n . Denote (F, G) = (F u . . . , F m , G 1 , . . . , G n ) G 
Qm+n and denote 

v f F k , l<k<m 

\ Gk-m, m + l<k<m + n 

Definition 3.2. 1. For a G S(m, n), F ett m ,G G tt n , 

a(F,G) = (U a{1) ,...,U a{m+n) ) eh m+n 
is called a shuffle of F and G. 

2. Denote 

S(F,G) = {a(F,G) | a G S(m,n)} 
for the set of shuffles of F and G. 

3. Let a G S(m,n) and let T be a subset ofT a . The element 

G);T) = (C/ CT (i) ; U a (2) U a ( m+n )) 
is called a mixable shuffle of F and G. 
4- Denote 

S(F, G) = {<j((F, G);T) I a G S(m, n), T C %}. 
for the set of mixable shuffles of F and G. 
If we further have H = (Hi, . . . , He) G Cl e , then we denote 

S(S(F,G),H)= |J S(U,H), 

U£S(F,G) 

and 

S(F,S(G,H))= y 

UeS(G,H) 

For a G S(m, n, t) and T G T a , define a mixable shuffle of F, G and H by 
a((F, G, H);T) = (£/ CT (i) U a ( 2 ) U^ m+n+ e)) , 

in which 

( F k , l<k<m, 
Uk — \ Gk-m, m+l<k<m+n 

[ H k - m _ n , m + n+1 <k<m + n + £. 
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Then for l<k<m + n + £ — 1 , we have 



U a (k) , Ucr^k+l) l Kr(fc+2) 



U a(k) U U a(k+1) U U a{k+2) {k, k + 1, k + 2) G T 

U U a{k+1) ,U a{k+2) (k, k + 1, fc + 2) £ T, (fc, fc + 1) G T 

C^r(fc), ^(fc+i) U £^^+2) (M + l,H2)^T,(Hl,H2)eT 

, U a{k+1) , U a{k+2 ) otherwise 

Denote 

S{F, G, H) = {a{{F, G,H);T)\ae S{m, n, £), T C T ff }. 
for the set of mixable shuffles of F, G and H. 

Proposition 3.3. Let Xi, . . . , X m , Y±, . . . , Y n , Z±, . . . , Z# be disjoint subsets of Q. Let 
X = (Xi) G n m , Y = (Y-) en n , Z = (Z k ) G n e . Then 

1. | S(X,Y) |= s(m,n). 

2. S{X,Y) = S(Y,X). 

3. S(S(X, Y),Z) = 3{X, Y, Z) = S(X, S{Y, Z)). 

Proof. 1. We use induction for m + n with m, n > 1. When m = 1, S(X, Y) contains 
the vectors 

(Xi, Yi, . . . , Y n ), (Yx, . . . , Yi-i, Xi, Yi, . . . , Y n ),i — 2, . . . , n, (Yi, . . . , Y n , Xi) 

and 

(Xi U Yx, Y 2 , . . . , Y„), (Y x , . . . , Y_ 1? X 1 U Y, Y m , . . . , Y n ),i = 2, . . . , n - 1, 
(Yi, . . . , Y„_i, Xi U Y„). 

So there are at least 2n + 1 elements in 5*(X, Y). By construction, the set S(X, Y) has 
no more elements than S(m,n) which is s(l,n) = In + 1 by Proposition IrS.ll So the 
claim holds in this case. A similar argument verifies the claim in the case when n = 1. 

Now assume that the claim is true for m + n < k with m,n > 1, and consider the 
case when m + n = k. Since m, n > 2, it makes sense to define 

S Xl (X,Y) = {(X 1 ,S)\SeS(X 2 ,...X m ,Y)}, 
S Yl (X,Y) = {(Y U S) \ SeS(X,Y 2 ,...,Y n )}, 
S XlUYl (X,Y) = {(X 1 UY 1 ,S)\S ES(X 2 ,...,X m ,Y 2 ,...,Y n )}. 

By assumption, X 1; Y\ and Xi U Yx are distinct. So we have 

S(X, Y) D £ Xl (X, Y) |J S Yl (X, Y) |J S XlUYl (X, Y) (7) 
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By induction hypothesis, the three sets on the right have cardinalities s(m— 1, n), s(m, n— 
1) and s(m — 1, n — 1). So 

| S(X, Y) |> s(m — 1, n) + s(m, n — 1) + s(m — 1, n — 1) 

which is s(m,n) by equation (jjj). So by Proposition 13.11 | S(X, Y) \= s(m,n). Then 
from equation we have 

S(X,Y) = S Xl (X,Y) |J |J S XlUYl (X,Y). (8) 

2. We prove by induction onm + n, m,n > 1. The statement can be directly 
verified for m + n < 2. Assume that it is true for m + n < k and let X G A m , Y £ A n 
with m + n = k. Then 

= {(X 1 ,5)|Se5((l 2 ,...,X m ),7)} 
= {(Xx,^ |5G5(F,(X 2 ,...,X m ))} 
= S Xl (Y,X) 

and similarly, 

S yi (X,F) = S Yl (Y,X) and ^un^F) = ^ux.^X). 
Then it follows from Equation (jHj) and its symmetric form for S(Y, X) that S(X, Y) = 

s(x,x). 

3. We first prove that 

S(S(X,Y),Z)= U 

c/e5(x,y) 

is a disjoint union. By assumption, Xi, Yj and Zk are disjoint subsets of Q. Since each 
component of U = (U r ) G S(X, Y) is either a Xi, or Y} or a Xi U it follows that the 
subset U r and Z k are also disjoint. Let U = (Ui, . . . , f/ r ) and U' = (U[, . . . , U' s ) be two 
distinct mixable shuffles of X and K. Ifr^s, then without loss of generality, we could 
assume that r > s. Then there is a U ro that is different from any Z7j and therefore is 
disjoint with any C/j. Thus C/ ro is disjoint with any component of any W G S(U',Z). 
On the other hand, U ro has non-trivial intersection with some component of every 
W G S(U, S). Therefore, S(U, Z) n 5(C/', Z) = 0. 

Now assume that r — s. We use induction on r. For r = 1, U ^ U' means U\ 
is different from any components of U' . So U\ is disjoint from any components of 
W G S(U', S), and the claim is proved. Assume that the claim is true for r, and let U 
and U' both have length r + 1. Suppose S(U, Z) PI £(£/"', Z) is not empty. Then there is 
a W G 5(17, Z) n S{U', Z). Write = (W u • • • , W*), then VFi = U x or Z a or U x U Z x . 
If Wi = U\, then since Lq is disjoint from any Z e , from W G 5(^7', Z) we get W\ = U[. 
This shows that 
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(w 2 , ...,w k )e S((u 2 , ...,u r ),z)n S((u>, ...,u' r ),z). 

But since U x = W x = U[, from U ^ U' we get (U 2 , ■ ■ ■ , U r ) ^ [U' 2) . . . , U' r ). Then by 
induction assumption, 

S((u 2 ,...,u r ),z)nsm,...,u^),z) = ( p. 

This is a contradiction. For the same reason, W\ = Z\ or U\ U Z\ also implies contra- 
diction. Therefore, the claim is true for r + 1. This proves that 

S(S(X,Y),Z) = U 

U&S{X,Y) 

By Proposition HH S(X, K) has (™)( m+ ™ _i ) mixable shuffles of length i, and for 
each such mixable shuffle U, the cardinality of S(U, Z) is 



Therefore S(S(X,Y),Z) has 

n £ 

v ■> v > , m+n—i , . n . , m+n+^— i— j , , £ , 

i=0 i=0 J 
n+i n 

\ ^ \ -\ . m+n+l— k , , 4? . . m+ri— i. ,ti. 
Z-~l Z-~l ^ a „ 



k—i n 
k=0 i=0 



elements. But S(S(X, Y), Z) is a subset of S(X, Y,Z), and by Proposition 13.11 the 
second set has cardinality bounded by the same number. This proves the first equality 
in part 3 of the proposition. 

To prove the second equality in part 3, define 

S(X, Y) Xm = {(S, X m ) | S e S(X U . . . , X m _x, Y)} 

and similarly for S(X,Y)y n , S(X,Y)x m uY n - Then the same argument as above gives 

S(X, Y) = S(X, Y) Xm |J S(X, Y) Yn |J S(X, Y) XlUYl (9) 
The rest of the proof is similar. ■ 
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3.3 Mixable shuffles of tensors 



Notation: For the rest of this paper, let A be a fixed element of C. For any C-modules 
M and N, the tensor product M ® N is taken over C unless otherwise indicated. Let 
A be a C-algebra. For nGN, denote 

= A® . . . <g> A 



n factors 

with the convention that A m = C . 

Let A be a C-algebra. For m,n,£ G N+, denote x = xi <8> . . . <g> x m G A® 
y = yi <g> . . . <S> y n e -4® n and 2; = <g) . . . ® ^ G A®^. For a G S n , denote 

a(x) = x a{1) <S> x a (2) <S> ■ ■ ■ <S> x CT(n ). 
Denote x <g> y = X\ ® . . . x m ® yi . . . (8) y n G ^4®( m + n ). and, for cr G S m+m denote 

a (x ®y) = W ff( i) (g) U ff(2 ) ® • • • ® U^m+n), 

where 

J Xk, 1 < k < m, 
Uk \ Vk-mi m + l<k<m + n. 

Likewise, for a G S' m+n+ ^, denote 

cr(x (g) y <g> z) = u a{1) <g> u ff(2) <g> . . . <g> n CT(m+n+ ^, 

in which 

{Xfc, 1 < k < m, 

Uk~m, m+l<k<m + n 
Zk-m-n, m + n + 1 <fc<m + ra + £. 

Definition 3.4. Let x G A® m ; y G A® n and a G S(m,n). 

1. a(x <S> y) G A®( m + n ) caZ/ec? a shuffle o/x and y. 

2. Let T be a subset ofT a . The element 

a(x ®y,T) = z a(1) (g)z a{2) (g) . . . ®z a{m+n) , 
where for each pair (k, k + 1), 1 < k < m + n, 

y A r _ / z*(k)Mk+V, (k,k+ 1) G T 

z^m^k+i) - 1 z ^ ^ (fc+i)? fc + 1} g T 

ca//ed a mixable shuffle o/rc and y. 
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in 



It follows from the universal property of the tensor product A® k that a(x <g> y; T) 
does not depend on the choice of x%, . . . , x m , y%, . . . , y n representing the tensor x <S> y. 
Now fix A G C. Define, for x and y as above, 

xo + y = \ m <y{x®y\T)e A® k . 

(a,T)eS(m,n) k<m+n 

The operation o + extends to a mapping 

o+ -,A® m xA® n ^ A® k ,m,neN 

k<m+n 

by C-linearity. Let 

W+(A) = W+(A, A) = A® k = C © A © A® 2 © . . . . 

fceN 

Extending by additivity, the binary operation o + gives a C-bilinear map 

o+ : m+(A) x in+(A) - ni+(A) 

with the convention that 

C x ^4® m ► 

is the scalar multiplication. This binary operation is called the mixable shuffle 
product of weight A. 

Theorem 3.5. The mixable shuffle product o + defines an associative, commutative 
binary operation on UI^(A) = ® fceN A® k , making it into a C-algebra with the identity 
■\r( C .1 

UIq(A) will be called the mixable shuffle algebra (of weight A) on A. In the 

special case when A = 0, UIq(A) is denoted by Sh(A) in [TTIj . 

Proof. We only need to verify the commutativity and associativity of the operation 
o + . For this we make use of the mixable shuffle of vectors discussed in the previous 
section. 

Recall that Q is an infinite set, and Q is the set of finite non-empty subsets of Q. 
For F = (Fx,..., F m ) eO m ,d6 S m and T a subset of {(£,£ + 1) | 1 < £ < m}, denote 

a(F; T) = (F a (x) F a {m)) 

where, for each 1 < £ < m, 

F * p _ / F a(i) U F a{l+1) if (£,£+!) ET 
a(l) ' 1 F aii) ,F a{e+1) otherwise 
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Given a (p : U™ 1 F« — > A, denote 

<f{a{F] T)) = (a CT (i)d) . . . ®a a{rn) ) 

where, for each 1 < £ < m, 

a °{t) = n 

For example, suppose Xx,X 2 ,Y, Z G f2 are distinct. Let F x = {Xi,X 2 }, F 2 = 

{Y}, F 3 = {Z}, a = ^ \ 3 ) e S ^ T = i ( 2 ' 3 )i- If f sends each of the 
capital letters to the corresponding lower case letter in the polynomial algebra A = 
C[xi,x 2 , y, z], then 

<p(<r(F;T)) = V (F a{1) ,F a{2) UF a{3) ) 
= y <g> x x x 2 z G A m . 

For any fixed m,n,£ and fixed x G A® m , y G A® n and z G y4® £ , choose distinct 
elements X±, . . . , X m , Yx, . . . , y n , Zx, . . . ,Zg from the infinite set O. Also use the same 
letters for the singletons {Xi}, {Yj} and {Zk} of fl. Let y> be the map sending Xi to 
Xi, Yj to and Z^ to Then o + (of weight A) could be described as 

x o + y = (p(X) o+ ip(Y) = A dog(6r V(^), 

ueS(x,Y) 

where deg(U) —\ T | if U is given by (cr, T) G S(m, n). The commutativity of o + follows 
from Proposition 13.31 2. 

Let a mixable shuffle W G S(X,Y,Z) be given by (a, T) G S(m,n,£), i.e., W = 
cr((X, Y, Z); T). Define deg(W) = degT, with degT defined in equation By 
Prop osit ion 13 . 31 3 . W could also be obtained from a mixable shuffle V in S(U, Z), given 
by ((Jx,Tx), where U is from a mixable shuffle in S(X, Y), given by (a 2 ,T 2 ). Thus we 
have V = ax((U,Z);Tx) and U = a 2 ((X,Y);T 2 ). It follows from the definition of W 
and deg W that the length of the vector W is m + n + £ — deg(W). Since W is also 
given by V = <Ti((U, Z)\ Tx), its length is also given by 

length of U + £ - deg(V) = m + n - deg(U) + £- deg(V). 

Thus we have deg(W) = deg(U) + deg(V). Then it follows from Proposition ^. 31 3 that 

(xo+y)o + z = ( A dcg(C7 V(£0) o+ z 

ueS(x,Y) 

UeS(X,Y) 
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= £ A dcg(C7) J- A dCg ( y V(^) 

ueS(x,Y) veS(u,z) 

j2 A dcg(i/)+dcg(i/ v(w) 

weS(S(x,y),z) 
weS(x,Y,z) 

We similarly have 

x o+ (y o + z) = J2 A dcg(H/ V(^)- 
weS(x,Y,z) 

This proves the associativity. ■ 



4 The free Baxter algebra 

We now use the mixable shuffle product from last section to describe the free Baxter 
algebras. We will first construct the free Baxter algebra on a C-algebra A. We will 
then give constructions of other types of free Baxter algebras. 

4.1 The basic free construction 

With the same notations as those in last section, we define m c (A) to be the tensor 
product algebra A © c H1q(A). Thus 



U1 C (A) = A® {k+1) = A © A® 2 © . . . 



fceN 

as A-modules and the product on UIc(A) is defined by the augmented mixable 
shuffle product (of weight A) 

(x © xi © ■ ■ ■ © x m ) o (y © yi © . . . © y n ) 
= XoVo ® ((xi © ... © x m ) o + (yi © . . . © y n )) 

= XoUo® ^2 A |T| o-((a;i © x x © . . . © x m ) © (y x © . . . © y n ); T) 

(a,T)eS(m,n) 

Define a C-linear endomorphism P4 on UIc(A) by assigning 

Pa(x © Xi © ... © x n ) = 1^ © :r © X\ © . . . © x n , 

for all a; © x x © . . . © x n G A® (ri+1) and extending by additivity. Let j A : A — > IHc(^) 
be the canonical inclusion map. 
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Theorem 4.1. For any C-algebra A, (iHo(A), Pa), together with the natural embed- 
ding j A : A ^ UIc{A), is a free Baxter C-algebra on A (of weight X) in the sense that 
the triple (lHc(A), Pa, Ja) satisfies the following universal property: For any Baxter 
C-algebra (R,P) and any C-algebra map <p : A — > R, there exists a unique Baxter 
C-algebra homomorphism <p : (illc (A) , Pa) — > (R,P) such that the diagram 

A— — — ^Ulc(A) 



commutes. 

Remark: By the same argument used to show the uniqueness of other "universal" 
objects, Hie {A) is the unique free Baxter C-algebra on A up to isomorphism. The 
Baxter C-algebra HLc(A) will be called the free Baxter C-algebra (of weight A) 
on A. 

Proof: We first show that P A is a Baxter operator on HIc(A). For this we only need 
to verify that for any x, y G m c (A), 

P A (x) o P A {y) = P A (x o P A {y)) + P A (y o P A (x)) + \P A {x o y). 

By additivity, we only need to verify this equation for any x = x\ ® . . . ® x m G A® m 
and y = y% <8> . . . <g) y n G A® n . By definition, 

P A (x)oP A (y) = (1a® a*® . ..®a; m )o(lA®2/i® •••®2/n) 
= 1a® \ m o-(x®y;T) 

(<j,T)£S(m,n) 

= P A ( ^(x^T)). 

(cr,T)eS(m,n) 

Recall that equation (jlj gives us 

S(m,n) = § 1>0 (m,n) (J S 0)1 (m,n) [J S 1A (m,n) 

where 

Si,o(m,n) = {(o~,T) G S(m, n) | (1,2) ^ T, c -1 (l) = 1}, 
S ,i(m,n) = {(a, T) G S(m, n) | (1, 2) G" T, a~ l (m + 1) = 1}, 
S 1A (m,n) = {(cr, T) G S(m,n) | (1,2) G T}. 

Further, 

^ A |T| a(x <g> y) = A |T| a((x 2 ®...®x m )(g)y;T) 

(o-,T)eSi,o(m,n) (er,T)6S(m-l,n) 
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= x ❖ (1a <S> yi ® ■ ■ ■ <8> y n ; T) 
= xoP A (y). 

where for m = 1, a((x2 ® . . . <8> x m ) ®y;T) = a(y) (only T = is possible). Similarly, 

^ A^z^T) = P A {x)oy. 

(a,T)eSo,i{m,n) 

and 

^ A |T V(x<g)y;T) 

= ^ ® ^ Tl v((x 2 ® ■ ■ ■ ® X m _i) ® (y 2 ® • • • ® 2/n-i);T) 

(cr,T)e50-l,n-l) 

Therefore, 

Pa(z) o Pa (2/) = Pa(x o Pa(2/)) + Pa(Pa(x) ot/) + APa(x o y) 

This shows that P A is a Baxter operator of weight A on Ulc(A), making it into a Baxter 
C-algebra. 

Before verifying that (uic(A), P A ) satisfies the universal property of a free Baxter 
C-algebra over A, we need some preparations. Let (R, P) be a Baxter C-algebra. For 
x, y G R, denote P x (y) = P{xy). For x <g> . . . (g> xt G _R®( fc+1 ) ) denote 

aro(oJ =1 P ap )(y) = x (P xl o • • • o P s J(j,), (10) 

with the convention that o° =1 P a . r = id^. It follows from the universal property of 
the tensor product _R®( fc+1 ) and the C-linearity of the Baxter operator P that the 
right hand side of equation (fT0|) is well-defined, and does not depend on the choice of 
Xo, . . . , Xk G R representing the tensor x ® ■ ■ ■ ® %k- F° r E S n) denote 

°{o^P Xr ) = o^P x<ry 

For m, n G N+, denote 

(o m P )o(o n P ) — o m+n P 

\ u r=l r Xr) u \ u s=l r ys) — u t=l r zti 

where 

J Xk, 1 < k < m, 

\ Vk-m, m + l<k<m + n 

and, for (<r, T) G S(m,n), denote 

<x((°™ =1 P, r ) o (o^Pj; T) = P zl o ... 6P Zm+n 
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where for each (fc, k + 1), 1 < k < m + n, 

P 6P f P ZkW (k,k + l)eT 

Zk Zk+1 I P Zk oP Zk+1 , (k,k + l)*T 
Proposition 4.2. For m, n E N and x = rr <8> • • • <8> x m e P®( m+1 ) , y = y$® . . .®y n E 

_^><g>(n+l) 

(xo(o™ 1 P Xp )(l Jl ))(ito(o^ 1 P I ,.)(l Jl )) 

= x y £ APVao^P,,.) o (oy =1 P w .))(l fl ). 

(«r,T)6S(m,Ti) 

Proof. It is clear that the equation in general follows from the case when xq — yo — 1, 
in which case the equation is 

(o™ 1 Px P (iH))(oy =1 p w .(i fl )) = X; A" r "<7((o™ 1 p Xp ) o (oy =1 p w .))(i fl ) 

(a,T)eS(m,n) 

with to, n > 1. 

For this we prove by induction on k = m + n. If k — 2, then m — n — 1. The 
equation to be proved in this case is 

P(xi)P(j/i) = P(xiP(j/i)) + P(yiP(x!)) + AP(x 1 y 1 ) 

which is part of the definition of P. Assuming that the equation holds for all x E 

jR ®(m+i) j y e ^(n+i) W ith m + n < k. Let a; G J R»( m + 1 ), y e p®("+i) with m + n = k. 
Then 

(o™ 1 P jep )(l Jl )(o^ 1 P tf .)(l Jl ) 
= P(x 1 (o^ 3 P SBp )(l Jl ))P(y 1 (o^P tf J(l Jl )) 
= P(x 1 (o^ ! P SBp )(l Jl )P(y 1 (o^P I ,.)(l Jl ))) 

+P(yi(o^ 2 P s ,)(ii i )P(x 1 (o^ ! p sep )(i Jl ))) 

+AP(x 1 (o^ 2 P Xp )(l Jl )y 1 (o^P w .)(l JI )) 
= P(x 1 ( ™ ! P SBp )(l Jl )(oy =1 P w .)(l JI )) + P((o™ 1 P sep )(l fl )y 1 (o^P Wr )(l Jl )) 

+AP(x 1 y 1 (o^ ! P Xp )(l Jl )(o^P w .)(l JI )) 
= P(xi £ Al T la((o™ 2 P^)o(o: =1 P ?/ J)(l i? )) 

(a,T)eS(m-l,n) 

+P( yi Yl * lT] °(K =1 P Xr ) o (o^P w .))(l fl )) 
(a,T)eS(m,n-l) 

+\P{x m Yl ^((o™ 2 P Xr ) o (o^P tf J)(l fl )) 

(«r,T)eS(m-l,7i-l) 
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(by induction) 

{<7,T)eS(m-l,n) 

+ £ A'^ o a((o^ 1 P a . r ) o (o: =2 P 2/s ))(l ii ) 

(ff,T)6S(m,n-l) 

+A E Al r 'P xm o ff ((o™ 2 P Sr ) o (o? =1 P tf J)(l fl ) 

(<r,T)e5(m-l,n-l) 

E Al r la((o- 1 P^)o(o^ =1 P 2/ J)(l i? ) 

(<7,T)eSi,o(m,n) 

+ E A m *((C=i^J°(°?=A))(l*) 

(CT,T)e5o,i(m,n) 
+ E A'M(°?=i^)°(°?=A))(l«) 

(o-,T)g5i,i(m,n) 
cr£S(m,n) 

This completes the proof of Proposition 14.21 ■ 

We now continue with the proof of Theorem 14.11 and verify the universal property 
for HIc(v4). For a given Baxter C-algebra A and a C-algebra map (p : A — > R, we 
extend (p to an Baxter C-algebra homomorphism <p : IIIc(v4) — > {R,i) as follows. For 

any x = x <8> £1 g) . . . g) G A®^ -1-1 ) , define 

£(x) = ^(x )(o J fc =1 P (p(:Ej ))(l iJ ). 

This is a well-defined C-linear map, hence extends uniquely by additivity to a C-module 
homomorphism 

<p : m c (A) = 0#( H1 ) ^R. 

ken 

It follows from the definition of the operation o and Proposition 14.21 that <p~ preserves 
multiplication. Since (p is C-linear, and 

<p(Pa(x ® . . . ® x k )) = <p(1a <8> %o ® • • • ® 

= (°-=o^))(lfi) 

= P(^(x )(o) =1 P^. ) )(l ii )) 

= P(ip(x <8> . . . <g> x fc )), 

y5 is a homomorphism of Baxter C-algebras. It is clear from its construction that (p is 
the unique homomorphism of Baxter C-algebras extending ip. This proves Theorem l4.ll 
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Let Alg c be the category of C-algebras and let Uc '■ Baxc — > Alg c be the forgetful 
functor. 

Corollary 4.3. 1. The assignment A \— > UIc(A) gives a functor 

in c : Alg c -> Baxc, 

where for a C-algebra homomorphism f : A — ► B , JHc(f) : IIIc(A) — »■ IHc(-B) is 
denned 6?/ ® n mf® n with f® n : A® n -> 

IIIc ^ e ^ e /^ adjoint functor of the forgetful functor Uc- 

3. Any Baxter C-algebra is isomorphic to a quotient of (U1 C (A), Pa) for some C- 
algebra A. 

By Corollary 14.31 the study of Baxter C-algebras is reduced to studying quotients 
of free Baxter C-algebras. 

From the proof of the theorem and Proposition 14. 2\ we also obtain 

Corollary 4.4. 1. For any sub-C -algebra B of a Baxter C-algebra (R, P), the Bax- 
ter sub-C-algebra of R generated by B is generated by 

{x (o r i=1 P Xi )(l R ) | x t e B, < i < r, r E N} 

as an additive group. 

2. For any C-algebra homomorphism ip : A —> R, the image of (p : IHc(A) ^ R is 
the Baxter subalgebra of (R, P) generated by <p(A). 

Proof. Let B be the Baxter sub-C-algebra of (R, P) generated by B. Denote 

S = {x (o r i=1 P Xi )(l R ) | x t e B, < i < r, r G N} 

and let B' be the additive subgroup of R generated by S. Since S is closed under scalar 
multiplication by C and the operator P, B' is a C-module and is closed under P. By 
Proposition 14. 2\ B' is closed under multiplication. Therefore, B' is a sub-C-algebra 
of R, hence contains B. On the other hand, since B contains B and is closed under 
multiplication and Baxter operator, it must contain S. Then B must contain B' by 
closure under addition. This proves the first statement. 

By its construction, the Baxter C-algebra \H C (A) is generated by Ja(A). Since (p is 
an Baxter C-algebra homomorphism, (p(jRc{A)) is also an Baxter C-subalgebra, and 
is generated by (pij^A)) = <p>(A). ■ 
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4.2 Other free constructions 



The construction of the free Baxter C-algebra UIc(A) on a C-algebra A in the last 
part could be combined with other free constructions to obtain free Baxter algebras 
on other structures. We now discuss the free Baxter algebra on a set. The free Baxter 
algebras on a monoid or on a C-module will also be considered. 

For a given set X, let C[X] be the polynomial C-algebra on X with the natural 
embedding X ^ C[X). Let (m c (X), P x ) be the Baxter C-algebra (m c (C[X]), P C [x))- 

Proposition 4.5. (uic{X), Px), together with the set embedding 

j x :X^ C[X) ^3 U1 C (C[X}), 

is a free Baxter C-algebra on the set X , described by the following universal property: 
For any Baxter C-algebra (R,P) over C and any set map ip : X — > R, there exists 
a unique Baxter C-algebra homomorphism (p : (m c (X), P x ) — > (R, i) such that the 
diagram 




Remark: When A = —1, it is easy to show that HIc(X) is closely related to the free 
Baxter algebra constructed by Cartierj2]. See section for detail. 



Proof. We only need to verify that (F C (X), P x ) satisfies the universal property of a 
free Baxter C-algebra on X. Fix a given Baxter algebra (R, P) over C, and a set map 
ip : X — > R. By the universal property of the C-algebra C[X], ip extends uniquely by 
multiplicity and C-linearity to a C-algebra homomorphism ip : C[X] — > R. Then by 
Theorem 14.11 ip extends uniquely to a homomorphism of Baxter C-algebras 

(p : (m c (X),P x ) = (m c (C[X]),P c[x] ) - (R,i). 

This proves the proposition. ■ 

Note that the free Baxter C-algebra on a set X can be described as the composite 
of two free constructions. First we construct the free C-algebra C[X] on X, and 
then we construct the free Baxter C-algebra on C[X]. In a similar manner, we can 
construct the free Baxter C-algebra on a monoid M by first constructing the free C- 
algebra C <M> on M and then the free Baxter C-algebra on C <M>. Here, for 
a given commutative monoid M, C <M> is the free C-module (Bx^mCx, where the 
multiplication on C <M> is induced by the multiplication on M. 

Likewise, we can construct the free Baxter C-algebra on a C-module N by first 
constructing the tensor C-algebra Tc(N) on N and then the free Baxter C-algebra on 
T C (N). 
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5 Relation to constructions of Cartier and Rota 



We now give an explicit description of the relation between our construction of free 
Baxter algebras and the constructions of Cartier [2] and Rota [TH] . 

As before, let C be a commutative ring with identity. Let Alg^ be the category 
in which the objects are C-algebras not necessarily having an identity and the mor- 
phisms preserve the addition and the multiplication, but do not necessarily preserve 
the identity Rota and Cartier considered the category Bax^ whose objects are pairs 
(R, P) where R is an object in Alg^ and P is a Baxter operator (of weight —1), and 
whose morphisms are morphisms in Alg^. that commute with the Baxter operators. 

For any non-empty set X, the existence of a free Baxter algebra on X in either 
of the two categories Bax^ or Bax^, can be proved by general results from universal 
algebra [HIE]- Rota and Cartier have given explicit descriptions of the free Baxter 
algebra in Bax^. Below we will focus on the relation of our construction of free Baxter 
algebras in Baxc with Cartier's construction of free Baxter algebras in Bax^. This 
will also explain the relation with Rota's construction of free Baxter algebras in Bax^., 
since, by the uniqueness of universal objects in Bax^, the free Baxter algebras of 
Cartier are isomorphic to the free Baxter algebras of Rota. 

We first recall the free Baxter algebra in Bax^ constructed by Cartier. Let M be 
the free commutative semigroup with identity on X. Let X denote the set of symbols 
of the form 

Uq ■ [ ] , m G M, u 7^ 1, and 
u ■ [ui, . . .,u m ] , m>l,u ,ui,...,u m eM,u m ^l. 

Let 23 (X) be the free C-module on X. Cartier gave a C-bilinear multiplication o c on 
23 (X) by defining 

(u • [ ])o c {vo ■ [ ]) = u v -[], 
(u • [ ]) O c (v ■ [vi, . . . , v n ]) = (v - [vx, . . . , v n ]) o c (u ■ [ ]) 

= u v ■ [vi, . . .,v n ), 

and 

Oo • [«i, • . • , u m )) o c Oq • [vt, ...,v n )) 

(-l) k+p+g Uov -<5> k ^ Q {[u 1 ,...,u m \,[v 1 ...,v n \). 

{k,P,Q)£S c (m,n) 

Here S c (m,n) is the set of triples (k,P,Q) in which k is an integer between 1 and 
m + n, P and Q are ordered subsets of {1, . . . , k} with the natural ordering such 
that P U Q = {1, . . . , k}, | P | = m and \ Q\ = n. For each (k, P, Q) e S c (m, n), 
$fc,p,Q(K, ■ ■ ■ ,u m ],[vi, ■ ■ ■ ,v n ]) is the element [ci, . . . , c k ) in X defined by 
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{a a , if j is the a — th element in P, j G" Q; 
bp, if j is the /5 — th element in Q, j P; 
a a bp, if j is the a — th element in P 
and the /3 — th element in Q 

Define a C-linear operator P% on 25 (X) by 

P£(«o-[]) = i-M, 

P^(M '[til,...,t( ro ]) = l-[tto,Ui,...,U m ]. 

Cartier proved that the pair (25(X), P£) is a free Baxter algebra on X in the category 
Bax^ 

On the other hand, since C[X] is a free C-module on M, it follows from our 
construction of the mixable shuffle product algebra HIc(X) = HIc(C[X]) that HIc(X) 
is a free C-module on the set X of tensors 

Uq <S> ■ ■ ■ Uk, Uk G M, /c > 0. 

The Baxter operator Px on Hlc(X) is defined by 

Px(u ® . . . <g> U m ) = 1 ® M ® . . . <E> M m . 

We define a map / : X — > X by 

/("o • [ ]) = u ; 
f(u ■ [ui, . . . ,u m }) = u <S> ui <S> ■ ■ ■ <S> u m , 

and extend it by C-linearity to a C-linear map 

Proposition 5.1. / is an injective morphism in Bax^., identifying 55 (X) with the 
sub-Baxter algebra ofUlc(X) with the C -basis 

{u <S> ■ ■ ■ u m | m > 0, Ui G M, i — 0, . . . , m, u m ^ 1}. 

Proof. Define h : X — > 25 (X) by = x • [ ]. Cartier showed that the pair (25 (X), h) 
is a free Baxter algebra on X in the category Bslx° c . Define g : X UI C (X) by 
g(x) = x £ UI C (X), x G X, and regard III^X) as an element in Bax^(X). By the 
universal property of 25 (X), there is a unique morphism 

~g : 25(X) - III C (X) 

in Bax^ such that g(x- []) — x. It follows from the fact that g preserves multiplication 
that g(u ■ []) = uq for uq G M, u ^ 1. 
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It remains to prove, for m > 1, uq, . . . , u m G M, u m ^ 1, that 

g(u ■ [Ux, U m }) = Uq <g> U X ® . . . ® U m . (11) 

We will prove this by induction. Let m — 1. If Uq = 1, we have 

g(u • [ui]) = g( p x(u>i)) = Px(9M) = 1 ® u v 

If uq ^ 1, we have 

g(u -[ui\) = <7((uo-[])o c (l-[ui])) 
= ^(«o-[])o^(l-N) 

= ?K'[])«»'[])) 

= M oPx-(^(ui • [ ])) 

= w oP x (wi) 
= w o(l®wi) 

= Uq®U x . 

Assume that equation (jllj) holds for m > 1 and consider the element 

Uq ■ [Ul, . . . ,U m +l], Ui G M,U m+1 ^ 1. 

If m = 1, then 

<?(it • K, ■ ■ .,Vi]) = g(P x (u 1 -[u 2 ,...,u m+ x])) 

= P x (g(ui ■ [u 2 , ■ ■ ■ , u m +i])) 

= P X (UI <g . . . <g Um+l) 
= 1 (g <g . . . <g U m+ i 
= M <g> Mi (g) . . . (g) M m+ i 

If Uq ^ 1, we have 

g(u ■ [ui, ...,u m+ i]) = g((u -[])o c (l-[u 1 ,...,u m+ i])) 

= g(uo •[])<> g(P x ( u i ■ K, • • • , Um+i])) 

= u Q oP x (g(uf[u 2 ,...,u m+1 \)) 

= U O Px(Ul (g) U 2 (g) • • • <g> «m+l) 
= W o(l ® 1*1 <g • ■ • ® "Um+l) 
= M ® «i ® ■ • • ® « m+ i- 

Thus g is the C-linear map / defined above, proving that / is a morphism in Bax^.. 
Since / is C-linear and sends the C-basis X of 23(A) injectively to the C-linearly 
independent set 
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{u (g> . . . ® u m | m > 0, Ui e M,u m ^ 1}, 
we see that / is injective. ■ 

Proposition 15. II enables us to identify 05(X) as a sub-Baxter algebra of IH^X) in 
the category Bax^,. We further have 

Proposition 5.2. The injective morphism f : 05 (X) — > HTc(X) m Bax^, satisfies the 
following universal property. For any element A in Baxc, also regarded as an element 
in Baxp, and any morphism <fi : 05 (X) — ► A in Bax^ ; there is a unique morphism 
cf) : HIc(X) — > A in Baxc such that <j) o f — <f>. 

Proof. With the notations introduced in the proof of Proposition 15. 1\ we have the 
following diagram 

X ^ 05(X) 

I 9 V i <t> 

m c (X) -U A 

We only need to find <p : IIIc(X) — > A in Bax^ such that the lower right triangle 
commutes. By the universal property of HIc(X) in Bax^, the map <ft o h : X — >• A 
induces a morphism ip : HIc(X) — > A in Bax^ such that ip o g = <fi o h. We then have 

ipofoh = %pog = (j)oh. 
From the universal property of 05 (X) in Bax^., we obtain 

if; o / = (p. 

So we can take <f> — ip. M 

6 The free Baxter C-algebra on C 

As a particular example, we consider IUc(C), the free Baxter C-algebra on C. IIIc(C) 
is also IHc (</>), the free Baxter C-algebra on the empty set 0, defined in Corollary 14.51 
This free Baxter algebra not only provides the simplest example of a free Baxter 
algebra, it also helps to explain an important difference between our construction of 
the free Baxter algebra and the construction of the free Baxter algebra of Rota or 
Cartier. As we see from the previous section, the free Baxter algebras of Rota and 
Cartier are in Bax[l and have no identity. In fact, the free Baxter algebra on <p in 
Baxp is the zero algebra with the zero Baxter operator. 
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If we choose A = C in the construction of free Baxter C-algebras, then we get 

<x> 

m c (C) = 0c^ (n+1) . 

n=0 

Since the tensor product is over C, we have (7®( n+1 ) = Cl®( n+1 \ where l®( n+1 ) = 
lc . . . <8> lq - in particular, l® 1 = lc- Thus IIIc(C) is a free C-module on the basis 

(fi+l)— factors 
l® n ,n > 1. 

Proposition 6.1. For any m,nGN, 

m 

j®(m+l) j®(n+l) _ Y^ m + n ~ fc ^ n ^fc^(g)(ra+n+l-fc) 

' ^ n 

fc=0 

Proof. This is immediate from part 2 of Proposition 13.11 ■ 

We also have the following consequence of Theorem 14.11 

Corollary 6.2. IHc(C) is the initial object in the category Baxc of Baxter C-algebras. 
In other words, for any Baxter C-algebra (R,P), there is a unique Baxter C-algebra 
homomorphism (HIc(C), Pc) (R^P)- 

Now we consider the special case when A = 0. In this case this C-algebra has been 
studied earlier [TH1 §12.3], as the shuffle algebra Sh(C) over C. The following facts 
from there can easily be verified. 

Proposition 6.3. Let A = 0. 

1. The free Baxter C-algebra UIc(C) is a bialgebra. 

2. For any m, neN, l^ m+1 ) o = ( m+n ) 1 ®(™+n+i) > 

n 

Now let HC be the ring of Hurwitz series over C jH], defined to be the set of 
sequences 

{(a n ) | a n e C,n e N} 
in which the addition is defined componentwise and the multiplication is defined by 

On) On) = (C„) 

with 

n 

Cn = ^("Wfon-fc- 
^ k 
k=0 

Denote e n for the sequence (a&) in which a n = lc and = for k ^ n. Since 
e n e m = ( m ^ n )e m+n , from Proposition 16.31 we obtain 
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Proposition 6.4. Let A = 0. The assignment 

!®(n+i) Cnj n > o 

defines an injective homomorphism of Baxter C-algebras from UIc(C) to HC , identi- 
fying Ulc(C) with the subalgebra of "Hurwitz polynomials" 

{(a n ) e HC | a n = 0, n » 0} = a„e„ | a n = 0, n » 0}. 

n>0 
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